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Abstract 

We give an efficient algorithm to randomly generate finitely generated 
subgroups of a given size, in a finite rank free group. Here, the size of 
a subgroup is the number of vertices of its representation by a reduced 
graph such as can be obtained by the method of Stallings foldings. Our 
algorithm randomly generates a subgroup of a given size n, according to 
the uniform distribution over size n subgroups. In the process, we give 
estimates of the number of size n subgroups, of the average rank of size n 
subgroups, and of the proportion of such subgroups that have finite index. 
Our algorithm has average case complexity 0{n) in the RAM model and 
0{n^ log^ n) in the bitcost model. 
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Algorithmic problems in combinatorial group theory have been the focus of 
increased interest (see [3l [TH [151 El HSl HZl HQ] for recent examples). This is 
especially the case for the theory of free groups and the investigation of their 
finitely generated subgroups, which is the focus of this paper. 

As in other fields, the investigation of algorithmic problems and of their 
complexity brings to light interesting structural properties of the objects under 
study. One classical approach is to study the properties of random objects, here 
of random finitely generated subgroups of a free group. This naturally depends 
on the probability distribution we adopt, we come back to this below. 

The complexity of algorithms is often estimated according to the worst- 
case paradigm. It can also be estimated in average, or generically [TB]. Both 
these concepts also depend on the choice of a distribution, and can benefit 
directly from the enumeration and generation results in this paper. Random 
generation can also be useful to test conjectures or algorithms with large set 
of representative instances (again, depending on the choice of a probability 
distribution), provided that the random generation algorithm is fast enough. 

In this paper, we use the well-known fact (Stallings [26]) that every finitely 
generated subgroup of a free group F with basis A admits a unique graphi- 
cal representation of the form (F, 1), where F is a finite directed graph with 
A-labeled edges and 1 is a designated vertex of F — subject to certain combi- 
natorial conditions, see Section [TTT] for details. Then we consider the number of 
vertices of F to be a measure of the size of the subgroup represented by (F, 1). 
Note that the size of H is strongly dependent on the choice of the basis A of 
F. Very importantly also, for each n > 1, there are finitely many subgroups of 
F of size n. The probability distribution on subgroups discussed in this paper 
is the uniform distribution on the set of fixed size subgroups: if there are s(n) 
subgroups of size n, each has probability 

A large and growing number of algorithmic problems on free groups admit 
efficient solutions using these graphical representations (see [HI [151 [16l [171 [HI 
[211 [24] among others), and this further emphasizes the interest of a random 
generation scheme based on this representation. 

The main result of the paper is an efficient algorithm to randomly generate 
size n subgroups of F. Its average case complexity is 0{n) in the RAM, or unit- 
cost model, and ©(n^log^n) in the bit-cost model, see Section 1331 for a more 
precise discussion. Our algorithm actually generates graphical representations 
(F, 1) of subgroups, but we want to emphasize that, as these representations 
are in bijective correspondence with finitely generated subgroups of F, we truly 
achieve a uniform distribution of (size n) subgroups. 

The strategy followed by the algorithm is described in Section 11.21 The 
algorithm itself is actually simple and easily implementable, besides being fast. 
Its proof is more complex; it relies on the recursive nature of the combinatorial 
structures underlying graphical representations of subgroups (see Section [51 and 
in particular Section 12. ip . and we make direct use of the concepts and the 
tools of the so-called recursive method heralded by Nijenhuis and Wilf f22| and 
systematized by Flajolet, Zimmermann and van Cutsem [TTj . 



2 



In Section SI we study the distribution of the ranks of size n subgroups and 
we show that if F has rank r > 2, then the mean value of the rank of a size n 
subgroup is (r — l)n~ r^/rl+ 1, with standard deviation o{y/ri). In Section[5l we 
show how our strategy can be modified (and simphfied) to randomly generate in 
linear time size n finite index subgroups (that is, subgroups of rank (r — l)n + 1) 
— even though these groups are negligible among general size n subgroups. We 
actually give a precise estimate of the proportion of such subgroups and we 
prove that it converges to faster than the inverse of any polynomial. 

The paper closes on a short discussion of related questions, and in particular 
of the comparison of our distribution with that which is induced by the random 
generation of an n-tuple of words and the consideration of the subgroup they 
generate, see [Ml [191 HQ] for instance. 

Throughout this paper, we denote by \X\ the cardinality of a set X, and by 
|1, n] the set {1, . . . , n} (where n is a positive integer). 

1 General notions and generation strategy 

1.1 Graphical representation of subgroups 

Let F be a free group with finite rank r > 2 and let A be a fixed basis of F. We 
sometimes write F — F{A) and the elements of F are naturally represented as 
reduced words over the alphabet AU A~^. It is well-known that the subgroups 
of F are free as well. Moreover, each finitely generated subgroup of F can be 
represented uniquely by a finite graph of a particular type, by means of the 
technique known as Stallings foldings [26j (see also [281 [HI [27] ) . We refer the 
reader to the literature for a description of this very fruitful technique, and we 
only record here the results that will be useful for our purpose. 

An A-graph is defined to be a pair F = {V, E) with E <Z V x A x V , such 
that 

• if {u, a, (u, a, v') £ E, then v = v'; 

• if (m, a, v), (m'j a, v) G E, then u = u' . 

The elements of V are called the vertices of F, the elements of E are its edges, 
and we sometimes write V{T) for V and E{V) for E. We say that F is connected 
if the underlying undirected graph is connected. If u € V{T), we say that v is 
a leaf if v occurs at most once in (the list of triples defining) E{T) and we say 
that F is v-trim if no vertex w ^ v is a, leaf. Finally we say that the pair (F, v) 
is admissible if F is a w-trim and connected yl-graph. 
Then it is known that: 

• Stallings foldings associate with each finitely generated subgroup H of 
F{A) a unique admissible pair of the form (F, 1), which we call the graph- 
ical representation of H in this paper |261 1281 115] ; 
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• every admissible pair (F, 1) is the graphical representation of a unique 
finitely generated subgroup of F{A) [26 l [28l fT5] : 

• if iif is given by a finite set of generators (in the form of reduced words 
over A U A~^) of total length n, then the graphical representation of H 
can be computed in time 0{n\og* n) [77] ; 

• if (r, 1) is the graphical representation of H, then rank(i7) = |-E'(r)| — 

\v{T)\ + i 

• if (r, 1) is the graphical representation of H, then H has finite index if and 
only if for each v e V(T) and for each a A, there is an edge of the form 
iv,a,w) G E{T) [MIIMIIIS], if and only if rank(i7) = (|^| - l)|T^(r)| + 1. 

We sometimes identify H and its graphical representation (F, 1) — for in- 
stance when we say that we randomly generate subgroups of F: what we gener- 
ate is actually the graphical representation of such subgroups. As explained in 
the introduction, we consider the number of vertices to be a measure of the size 
of r and we write \H\ = \T\ = \V\. In particular, F has finitely many subgroups 
of size n. 

1.2 Enumeration and random generation 

As we shall see, A-graphs fall in the category of decomposable structures, that 
is, structures that can be built from unit elements and from operations such as 
the union, direct product, set formation, etc. We will use the so-called recursive 
method to enumerate and to randomly generate such structures Details 
are given further in the paper, concerning the enumeration (Section [2]) and the 
random generation algorithm and its complexity (Section [3]). At this point, 
let us simply say that the random generation of size n A-graphs requires a 
pre-computation phase in 0(n), after which each draw takes time 0{n). 
The rest of this section is devoted to an overview of our strategy. 

Remark 1.1 There exists another method than the recursive method, to de- 
rive a random generation algorithm from a combinatorial specification, this time 
according to a BoUzmann distribution. Recall that, in such a distribution, an 
object 7 receives a probability essentially proportional to an exponential of its 
size |7|. (More precisely this probability depends upon a positive real parameter 
X, and it is proportional to x''''! when 7 is an unlabeled structure and to a;''''' 
when 7 is labeled; see Section 11.2.11 below about labeled vs. unlabeled struc- 
tures.) In particular, Boltzmann samplers do not generate objects of a fixed 
size. They depend on the real parameter a; > and, for any given integer n, the 
value of X can be chosen such that the average size of the generated elements is 
n. Even though the size of the objects generated is not fixed, Boltzmann sam- 
plers guarantee that two elements of the same size have the same probability to 
be generated. 

A method to systematically produce Boltzmann samplers was recently in- 
troduced by Duchon, Flajolet, Louchard and SchaefFer [7] for labeled structures 
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(Flajolet, Fusy and Pivoteau for unlabeled structures in [S]). The evaluation of 
X is the only required precomputation and the complexity of generation itself 
is linear as long as small variations in size are allowed. This approach can also 
be used for exact-size generation, but in the case of ^-graphs it is less efficient 
than the recursive method (see Remark l2.3|) . □ 



1.2.1 We count labeled A-graphs 

Enumeration for us, is the enumeration of structures up to isomorphism. The 
structures which we want to generate are admissible pairs (F, 1), that is, A- 
graphs with one vertex labeled 1, that are connected and 1-trim. We later use 
the phrase admissible A-graphs. Leaving aside for a moment the properties 
of connectedness and 1-trimncss, we are interested in A-graphs with a distin- 
guished vertex. This is an intermediary situation between labeled and unlabeled 
structures, which are two great categories of structures for which there exist a 
large toolkit for enumeration and random generation [TTl [lOl [71 [8] . 

An A-graph F = {V, E) of size n is said to be labeled if it is equipped with 
a bijection A: V. Of course, there are n\ different such bijections, but 

some of them may yield isomorphic labeled A-graphs. For instance, \i E — % 
(so that F consists of n isolated vertices), all labelings of F are isomorphic. 




Fi F2 F3 r4 

Figure 1: Four A-graphs with different numbers of non- isomorphic labelings 



Example 1.2 Consider the A-graphs in Figure[T] Then all labelings of Fi are 
isomorphic but F2 has 2 non-isomorphic labelings. How many non-isomorphic 
labelings does F3 have? And F4? □ 

Counting structures such as A-graphs can lead to complex considerations 
involving, say, the automorphism groups of each of the connected components 
of the A-graphs, but standard methods of analytic combinatorics actually solve 
this enumeration problem for us, see Section fl. 2. 21 These methods rely on the 
use of labeled structures, and we will therefore enumerate and generate labeled 
A-graphs. Why this is justified even for the purpose of randomly generating 
admissible pairs is discussed in Section [l.2.3l 
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1.2.2 Generating series 

Let ^ be a class of combinatorial structures. If A has a„ elements of size n, the 
formal power series ^„ a„2;" and J2n called respectively the ( ordinary) 

generating series and the exponential generating series (EGS) of the class A. 

As it turns out, certain operations on classes of labeled combinatorial struc- 
tures have a direct translation over their EGS. For instance, suppose that A is 
the union of the disjoint classes B and C (that is, a size n element of ^ is a size 
n element of either ;B or C), and let A{z), B{z) and C{z) be the EGS of the 
three classes. It is immediate that A{z) = B{z) + C{z). 

For more complex operations, one needs to handle the question of relabelings. 
If 5* is a size n structure with a labeling function A, we say that /i is an expansion 
of A if the domain of fj, is of the form {fci, . . . , kn} C N with fci < • • • < A:„ and 
/Li(/ci) = X{i) for each i. If Si,...,Sr are structures of size ni,...,nr with 
labeling functions Ai, . . . , A^, then the sequence S = (S'l, . . . , Sr) is a structure 
of size n = and we say that a labeling A of is compatible with the Ai if 

it is obtained by the combination of expansions of the Ai, whose domains form 
a partition of In particular, A is not uniquely determined by the A^. 




Fi r2 a labeling of (Fi, r2). . . and another 

Figure 2: Two labeled A- graphs and two compatible labelings of the sequence 
they compose 

Example 1.3 Figure [2] shows two labeled v4-graphs of sizes 2 and 3, and several 
compatible labelings of the sequence they compose. □ 

We record here three such operations, that will be important for our purpose, 
and we refer the readers to [Hllinj for the proof of this important result. 

Proposition 1.4 Let A be a class of labeled structures with EGS A{z). 

• Let B be the class of sequences of structures from class A - that is, a 
size n labeled structure in B is a tuple {Si, . . . , Sr) of labeled structures in 
A (each Si of size Ui with n = equipped with a labeling function 

compatible with the labelings of the St . Then the EGS of B is 

^^^^ ^ T^y 
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• Let C be the class of cycles of structures from class A, where a cycle of 
structures is an equivalence class of sequences of structures in A, under 
the relation which identifies a sequence with its cyclic permutations. Then 
the EGS ofC is 

• Let T) be the class of sets of structures from class A, where a set of struc- 
tures is an equivalence class of sequences of structures in A, under the 
relation which identifies a sequence with its permutations. Then the EGS 
ofV is 

D{z) = exp 

Let us first apply this calculus to a simple example. Let A be the class 
consisting of a single graph, with one vertex and no edges (the vertex being 
necessarily labeled 1). Its EGS is A{z) = z. The class of labeled sequences of 
structures in A is in bijection with the class of labeled line graphs of the form 
with {?!,... = and its EGS is, according to Proposition 11.41 equal 

(£) — K^s) — *^ — *- ■■■ © 

to = ^„ z". This corresponds to the fact that the number of such size n 
sequences is n\ (so that its quotient by n\ is 1). 

Similarly, the EGS of the class of cycles of structures in A^ that is, the class 
of labeled cyclic graphs (such as the graphs in Example 11.21 that are labeled 

with a single letter) is log ( irri) • 

We extend these examples in Section 12.11 to compute the EGS of labeled 
yl-graphs. 

1.2.3 A rejection algorithm 

The enumeration and random generation of labeled ^-graphs is however not 
our objective. We want to generate admissible pairs, that is, A-graphs with a 
distinguished vertex 1, that are connected and 1-trim. This will be achieved by 
a rejection algorithm (see [5]). 

Suppose one wants to draw a number between 1 and 5, using a dice. It is 
natural to throw the dice repeatedly, until the result is different from 6. This is 
a semi-algorithm, since in the worst case it may never end — if we draw only 
6's — , but we will loosely call it an algorithm. 

Formally, suppose we want to generate elements of a set X, according to 
a probability law px. Suppose that X is a subset of F, and that we have a 
probability law py on Y , whose restriction to X is px- If we have an algorithm 
to generate elements of Y according to pY^ we may use this algorithm to generate 
elements of X as follows: repeatedly draw an element of Y , reject it if it is not 
in X, stop if it is in X. The average complexity of such an algorithm depends 
on the complexity of the generating algorithm on Y , on the complexity of the 
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test whether an element of Y is in X, and on the average number of rejects. 
One can show that if py{X) is the probabihty for an element of Y to be in X, 
the average number of rejects is 1/py{X). 

Concretely, we will show that the probability p„ for a size n labeled j4-graph 
to be connected and 1-trim (rigorously: A(l)-trim) tends to 1 when n tends to 
infinity (Sect ion 1 2. 2 1 and 1^75)) . This justifies the use of a rejection algorithm since 
the average number of rejects tends to when n tends to infinity. 

There remains one problem: such a rejection algorithm will generate labeled 
connected 1-trim A-graphs, and we are interested only in the information con- 
tained in (r, A(l)) - that is, we do not care how the n — 1 vertices different from 
A(l) are labeled. In other words, an admissible pair, obtained from a labeled A- 
graph by forgetting the labeling of the vertices numbered 2 to n, will be counted 
several times. The following lemma shows that this is not an obstacle. 

Lemma 1.5 Let T be an A-graph of size n and let v he a vertex ofT. IfT is 
connected, then there are (n — 1)! isomorphism classes of labeled structures on 
T such that vertex v is labeled 1. 

Proof. Let A = {a \ a G A} he a disjoint copy of A, and let F be obtained 
from r by adding, for each edge (a;, a, y), a new edge (y, a, x). It is immediately 
verified that F is an {A U A)-graph. For each vertex w v, let he a. finite 
word on the alphabet AU A labeling a path in F from v to w. Then w is the 
unique vertex accessible from v following a path labeled w^, and the words 
are pairwise distinct. This observation guarantees that distinct labelings of F 
mapping 1 to u, are non isomorphic. □ 

Thus each size n admissible pair (F, 1) is counted the same number of times, 
namely (n — 1)! times. Therefore, applying a rejection algorithm that randomly 
generates size n labeled connected 1-trim A-graphs and forgetting labels 2 to n 
also guarantees a random generation of admissible pairs for the uniform distri- 
bution on all admissible pairs of size n. 

Summarizing the algorithmic strategy, we will randomly and equally likely 
generate a labeled A-graph, reject it if it is not connected and 1-trim, draw 
another labeled A-graph, etc, until we draw a connected 1-trim labeled A-graph. 
We then ignore the labeling of the vertices numbered 2 to n. Details on the 
algorithm and its complexity are discussed in Section [31 

For convenience, we will call a labeled A-graph F admissible if the pair (F, 1) 
is admissible in the sense of Section 11.11 

2 Enumeration of 74-graphs 

We first observe that in an A-graph F = (V, E), for each a G A, the edges in E 
of the form (x, a, y) can be interpreted as the description of a partial injection 
from V to V (partial means that the domain of this injection is a subset of V). 
If the A-graph is labeled and has size n, each letter can therefore be interpreted 
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as a partial injection from |l,n] to itself. The labeled A-graph itself can then 
be seen as an ^-tuple of partial injections. 

In this section, we discuss the enumeration of partial injections on |1, n], the 
probability for an A-tuple of such partial injections to yield a labeled connected 
A-graph, the probability for that graph to be 1-trim, and finally the number of 
size n subgroups. 

2.1 Partial injections and y4-graphs 

Each partial injection is a set of disjoint cycles and non-empty sequences (in 
analogy to the decomposition of a permutation as a union of cycles). The 
EGS for cycles is \og{j^), and that for sequences is (Section ll.2.2p . It 
follows that the EGS for non-empty sequences is — 1 = jzr^i ^ind the EGS 
for the union of the (disjoint) classes of cycles and non-empty sequences is 
+ ^og{j^). Then Proposition 1 1 .41 shows that the EGS for partial injections 

is 

'^'^ = + (y^)) = exp . 

Let /„ be the number of partial injections from to itself, so that I{z) = 

Z^n n! ^ ■ 

Remark 2.1 The series I{z) turns out to be also the (ordinary) generating se- 
ries of the average number of increasing subsequences in a random permutation. 
The first values of the sequence (/n)n>o, referenced EIS A002720 in [25] are 

1, 2, 7, 34, 209, 1546, 13327, 130922, 1441729, 17572114, 234662231,... 

□ 

The above expression of I{z) yields a simple recurrence relation for the 
sequence (/„)„. Indeed, we find that the series I'{z) = X)n>o "^T""^" equal to 

= (T^ (l^) = 11^ 

Thus (1 — z)'^I'{z) = (2 — z)I(z) and the following recurrence relation follows: 

Vn>2, /„ = 2n/„_i~(n-l)'/„_2, (1) 
with Iq — I and Ii = 2. 

Lemma 2.2 For each integer n > I, we have 

{n + l)l < (7i + l)/„_i < /„ < nei/^/„_i < n! e^v^"^ 
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Proof. We proceed by induction on n, noting that Iq = 1. The inequahty 
(n + l)J„_i < /„ is verified for n — 1 and n = 2. Suppose that n > 2 and 
(n + l)In-i < In- By the recurrence relation, we have 

/„+i = (2n + 2)/„ -n2/„_i (2n + 2)/„ - (n + l)2/„_i + (2n + l)/„_i 

> (2n + 2)/„-(n+l)/„ + (2n+l)/„_i 

> (n+l)/„ + 2n/„„i 

> (n +!)/„ + /„ = (n + 2)/„, 

with the last inequality derived from the recurrence relation on the /„. Thus, for 
eachn > 1, we have /„ > (n+l)/„_i. The inequality (n+1)! < (n+l)/„_i < /„ 
follows immediately. 

For ri > 1 let m„ = proceed by induction on n to prove that 

Un < ne^/^, noting that ui = 2 < e. From Equation ([T]) 

2 

u„+i = 2{n + 1) - — < 2(n + 1) - ne'^/^ 

It remains to show that 

2(n + 1) - ne-i/^ < (n + 1)6^/^'^. 

or 

2(n + 1) < ne^i/^ + (n + l)ei/^^. 



For any real number x, 



^2 ^3 

e"> 1 + 2;+ — + — . 

2 



Therefore 

ne-i/^ + (n + 1)6^/^'^ > 2{n + 1) + \^H~T 



6\/?i + 1 6^/n 



and we want to show that 



1 1 , 7 r- V^-Vn+T 



Vn + 1 - + — ^= - = VrTTT - + \ =- > 0, 

or equivalently, 



6(71 + l)Vn - 6nVn + l + \/n-\/n+l > 



(6n + 7)V« > (6n+ l)Vn+ 1 
(6n + 7)^n > (6n + l)2(n + 1). 

Now the difference ((6n + 7)2n) - ((6?! + l)^{n + 1)) is equal to SGti^ + 36?! - 1, 
which is positive for all n > 1. This completes the proof that m„ < ne^/^ for 
all n> 1. 
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Consequently /„ < ne-^/^J„_i and 

As the function x i-^ is decreasing on the positive domain, we find that 
Vi+l — •'t Vx — ' 

Thus e^'=i ^ < e^^"^, which concludes the proof. □ 

Remark 2.3 The computation of I{z) allows us to justify our assertion that 
Boltzmann samplers are less efficient than the random generation based on the 
recursive method, see Remark 11.11 More precisely, the behavior of Boltzmann 
samplers is often such that the size of the generated object sits between (1 — e)n 
and (1 + e)n with high probability. In our case, it is essential that the tuple of 
partial injections we generate all have the same size. It is often the case that 
a Boltzmann sampler can be used to produce an exact-size sampler, using a 
rejection algorithm. In the case of partial injections however, as the distribu- 
tion of the sizes of partial injections is not sufficiently concentrated around the 
mean size, each draw of a random partial injection of size exactly n takes times 
©(n^/^), which is not very satisfactory. Here is why. 

The mean size of a partial injection produced under the exponential Boltz- 
mann model is (see 7, Proposition 1]): 

^ , . ^ r(x) 2-x 

Ea;(size of a partial mjection) = x- ^ 



I{x) (1 - x) 
and its variance is 

d 



cr^ (size of a partial injection) = —Ex (size of a partial injection) = --:7. 

ax ' (1 — xy'^ 

Thus, to generate partial injections of expected size Ej; = n, one has to choose 
X = 1 - 1/V" + 1- In this case , al = 2(n -I- if/"^. From [7, Theorem 4] 
dealing with H-admissible generating functions (see Section I2.3.2|) the exact- 
size generation requires y/QjTCJx = 0{n^^'^) rejections in average and the overall 
cost of exact-size sampling is 0{nax) — 0{rJ^^) in average. □ 

As discussed at the beginning of this section, if r = |^|, then a labeled ^-graph 
of size n can be assimilated to a r-tuple of partial injections on |l,7i], so the 
EGS of labeled A-graphs is 

rr 

E n n 

n>0 
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2.2 Connectedness 

Recall that an A-grapli is connected if the underlying undirected graph is con- 
nected. In this section, we show the following result. 

Theorem 2.4 Let A be an alphabet of cardinality r > 2 and let pn be the 
probability for an n-vertex labeled A-graph to be connected. Then lim„H^oo Pn = ^ 
and more precisely, pn — I — ^ o(_L_)_ 

In particular, this shows that labeled A-graphs are asymptotically connected. 
The proof of Theorem 12.41 given below, relies on the following theorem, due to 
Bender (see [U p. 497] for a survey and 2J for a complete proof). 

Theorem 2.5 Let F(z,y) be a two-variable real function which is analytic at 
(0,0). Let J{z) = J2n>ajnz'\ C{z) = E„>oC«^" and D{z) = E„>o'^»^" 
functions such that 

dF 

C(z) = Fiz,J(z)) and D(z) = —(z,J(z)). 

dy 



If the sequence {jn)n>o satisfies jn-i — o{jn), and if for some s > 1, we have 
Y2=l \jk3n-k\ = 0{jn-s), then 

s-1 



C„ = ^ dkjn-k + O (jV, 



fc=0 



Proof of Theorem [2T4l Let J(z) = X]n>iJ"'^" where jn = Then the 
EGS of labeled A-graphs is 1 + J(z). 

Decomposing these graphs into their connected components (connected com- 
ponents of the underlying undirected graph) and using Proposition II. 4[ we find 
that 1 + J{z) ~ exp(C(z)), where C{z) = X]fe>i '^kz'', with Ck = ^ and Ck is 
the number of connected labeled ^-graphs with k vertices. 

It follows that C{z) = log(l + J{z)). We note that the map F{z,y) = 
log(l -I- y) is analytic at (0, 0) and we let 

dF 1 
D{z)^—{z,J{z)) 



dy' ' ' " l + Jiz)- 
By Lemma [121 ^n-i ^ "^i+i' therefore 

jr jr jr 

j , ^ ^"-1 < tn < ^" 

^ {n - 1)! - {n + l)''(n - 1)! " n"-^ nl 



In particular, we have 



Jn-l^O[^]^oUn). (2) 
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We now want to verify whether J2k^l Jkjn-k ~ C(jri-s) (this is the last 
hypothesis of Theorem 12.51 for a fixed s > 1). Let S be the sum above. By 
symmetry, we get 

" rr jr I L"/2J jr 

^n-k ^ ^ I i-k n-A 



2-^ h\ (t, - h\\ - h\ 



k—s 



ki {n-ky. - I kl {n~ky. 



We show that for n large enough, the finite sequence ( ^ /Imi 1 is 
decreasing. From Equation ([1]), we have 

= 2{k + l)-k'^-^ < 2(fc+l); 

Jfc ^k 

and by Lemma 12.21 we have ^"/'''fc'^' — n~k+i ■ Therefore 

Ik+Jn-ik+l) ^ 2'^(fc + l)'- 



iu:-k - {n-k + iy 

Moreover jj^^ = f:pf> and it follows that 

^fe+i-^,';-(fe+i) k\{n-ky. ^ 2'^(fc + l)'' n-k 



2''(fc + l)''-i n-k 



< 



< 



{n-k + n-k + 1 

2''{k + iy-^ 
{n - k + 1) 



r-l ■ 



This value is less than or equal to 1 when ^^^^^ is less than or equal to c = 
2~^. Since r > 2, we have j < c < ^, and for any k < ^'l^+i < i < c. 

For any n > 18, [^J < thus the sequence is decreasing on the domain 

s<fc< LtJ- 



-6 

L"/6J 



jr jr jr jr 

^^^(n-fc)!fc! - ^^ '^ '{s + iy [n-s-iy 



FromLemmalia I^^s-i < in-s+iy ^n-s and /J+i < [s + lYe^/^Il. There- 
fore 



(n- s- 1)! (s + 1)! - {s + iy. [n- s-iy. {n- s + iy 



< 



{n ~ s)! s! (n - s + l)''-i 
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and we have 

L"/6J 



in-k II ^ las ^ ^i-s /r(s + ir-ie'-/v^ 



^_ (n - fc)! fc! - / J >(n^s)\ s!(n-s + l)'-i ' 

which is O ( (^^""*'^"^') ■ Note that it is O ( (^%) when r = 2. 

We now study the remaining part of the sum S. It foUows from Lemma 12.21 
that if fc > s, then /„-fe < ("ZltZijji -^w-a; so we have 

Il-k < (n-(fc-l))r-i(n-fc + l) 



(n-A:)! " (n - (s - l))!'-i (n - s + 1) (n - s)! 



(n - s + 1) (n - s)!' 



By Lemma [2.21 again, we have Ik < k\ and hence 

fc! ~ 



It follows that 

1=0 

Note that for s = 1, UtZain - iy-^ ^ 1 

/r, (rt-A 



{n~k)\k\-\k) n-s + l Ai*^ ^ {n-s)\ 



Now observe that for each s < k < n/2, Hi^o ~ ^ < 



We get 

L«/2J ™ L"/2J , .-(r-1) 

_V-^£fc ^ s(r-l) Ai-s (2Vfe-l)r 

^ (n-fc)! fc! - (n-s)! ^ U/ 

fe=[n/6j+l ^ ^ ^ ^ fe=Ln/6j + l ^ ^ 

For any fc such that [n/6j +1 < fc < [n/2j we have (^) > (|^„"gj)- Using Stirling 
formula (n! ~ V27re^"n"+^) we get 

Ln/6j+l/2 . I ,„| X n-L«/6j+l/2 



'27rn 



n 



Since < i and < 1, there exists < C < 1 such that (, " ,) < 

C" for n large enough. Hence, 

E ^ ^ ^ ^ 

fc=[n/6j+l 



3 VbV6j 



n\ (2x/fc-l)r < f \ ~ ' (2yiV2T-l)r 



- 3 
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In particular, for any D such that C < D < 1 we have 

L"/2J . s -(r-i) 

fc=[n/6j+l 

Consequently 

T^^^TT 5 = . O f where < < 1 

^ (n - fc)! fc! (n - s)! V / 

Finally we find that 

^/c!(n-A:)! (n-s)!Vs! V / V / 

(n — s)\ 

Hence '^^=1 ^kiii-k = ^On-s)- Thus we can apply Theorem 12.51 for any fixed 
positive integer s, it yields 



C„ = ^ dkjn-k + O (jn-s) ■ 



fc=0 



Since do = 1 and di — —ji = — /[ = —2'", we get that 

Cn = jn - 2''j„_i + O (j„_2) • 

Now Equation ([2]) above yields 



and the independent technical Proposition 12.101 below yields 



n\ 20? 
Therefore 

-1/4 



T p-1/2 

n-i/Vv^(l + o(l)). 



and 



^ = i f 1 - " e^v^(^-) (1 + 0(1)) = i(l + 0(1)) 

%i = ^(1 + 0(1)). 
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Finally 



Jn^l = T^^Ty = ^ ^ (1 + 0(1)) - ^ (1 + 0(1)). 

(n — 1)1 n\ rv ^ rv ^ 



We conclude that c„ — j„ (l — + o (— F^-)) or 

Recall that we denote by p„ the probability for an n- vertex graph whose tran- 
sitions are defined by an r-tuple of partial injections on |1, n] to be connected. 
Equation [3] shows that 

^ Tr ^ „r-l + 



which concludes the proof of Theorem 12.41 □ 



2.3 1-trimness 

Recall that for a labeled ^-graph T to be admissible, F must be 1-trim, that 
is, no vertex v 1 may be a leaf (see Section [1.2.31 and also Section [l.ip . 
Moreover, vertex u is a leaf if and only if it has less than 2 images or preimages 
in the \A\ partial injections that define F. 

In this section, we show the following result. 

Theorem 2.6 Let A be an alphabet of cardinality at least 2. The probability 
for an n-vertex labeled A-graph to have no leaf is 1 + o(l). 

We immediately record the following corollary, which gives our random gen- 
eration strategy its final justification. 

Corollary 2.7 Let A be an alphabet of cardinality at least 2. The probability 
for a given size n labeled A-graph F to be admissible, is I + o(l). 

Proof. In view of Theorems 12.41 and 12.61 an n-vertex labeled A-graph is con- 
nected with probability 1-1-0 (^), and without leaves (in particular: 1-trim) 
with probability 1 + o(l). It follows that the probability for an n-vertex labeled 
A-graph to be connected and without leaves is 

(^l + o(^i)) (1 + 0(1)) = l + o(l), 

and the result follows. □ 

The rest of this section is devoted to the proof of Theorem 12.61 We first 
observe that it suffices to establish this result when \A\ = 2: indeed, ii A = 
{oi, . . . , Or} with r > 2 and if the {ai, a2}-part of an A-graph has no leaf, then 
neither does the whole A-graph. So we assume that r ~ 2 for the rest of this 
section. 
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2.3.1 Number of sequences 



We note that a leaf in an ^-graph forms a length 1 sequence in the functional 
graph of one of the partial injections, and is an endpoint of a sequence in the 
functional graph of the other. 

We start with a study of the parameter X„, which counts the number of se- 
quences in the functional graph of a partial injection on [1, n]. We will compute 
the expectation and the variance of Xn in Section 12.3.31 Note that the number 
of endpoints of sequences in the functional graph of a random partial injection 
is bounded above by the quantity accounted for by the parameter 2Xn. 

We first introduce the bivariate series J{z,u) = ^ where Jn,k 

denotes the number of partial injections on |l,n], whose functional graph has 
k sequences (that is, such that X„ = k). The EGS of non-empty sequences was 
already computed and it is equal to or, in this multivariate setting, 
The EGS of cycles is log (also in this multivariate setting). The multivariate 
analogue of Proposition !! .41 fsee Flajolet and Sedgewick [101 ex. 7, section III. 3]) 
then shows that 

J(z, u) = exp ( -I- log I — ] ] = — exp 



1-z °\l-zJJ 

In particular, J(z, 1) = I{z). 

The expectation of variables Xn and X"^ are 



1-Z 



1 - Z 



E(X„) 



and E(X,t) 



Now J2k •^n,k is the coefficient of z" in J(z, 1) and kJ^^k is the coefficient of 
^. The coefficient of z" in -^J{z, u)\^_^ is k{k — l)Jn,k- 

1 s 1 



m£j{z,u) 
Let 



Hpiz) 

Then we have 

J(z,l) = 

— J{z,u) 
ou 



(1 - z)P 



I{z) = 



(1 - 



■ exp 



1 



I{z) - Ho{z), 



{l-z)- 

{i-zY 



■ exp 



■ exp 



(l-z) 



/(z) = ziJi(z), 

/(z) = Z^H2{Z). 



For convenience, if S{z) is a formal power series, we let [z"]S'(z) denote the 
coefficient of z" in S{z). Then the expectation of variables X„ and X"^ are 
given by 



E(X2) 



[z"]zi7i(z) _ [z"-i]i?i(z) 



[z"]/(z) 

[Z"]z2g2(2) 

[z"]/(z) 



[z"]i7o(^) 
[z"]zi?i(z) 



]i?2(^) 



[z«]i7o(z) [2"]i^o(^) 



(4) 



(5) 
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The variance of X„ is 

a\X„) = E(X2)-E(X„)2. (6) 

Thus finding asymptotic estimates of E(X„) and (T^(X„) requires finding 
estimates of the coefficients of the functions Hp{z) for p — 0,1, 2. 



2.3.2 Saddlepoint asymptotics 

Saddlepoint analysis is a powerful method to find asymptotic estimates of the 
coefficients of analytic functions which exhibit exponential-type growth in the 
neighborhood of their singularities. We refer the reader to the books by Flajolet 
and Sedgewick ^ and |10t Chap. VIII], and to the survey by Odlyzko ^ for 
a thorough presentation of saddlepoint analysis. 

The fast growth of the coefficients of Hp{z) justifies the application of sad- 
dlepoint analysis. The theorem we want to use, Theorem 12.91 below, requires 
an additional hypothesis, namely the H- admissibility of the functions Hp [lOl 
Section VIII. 5]. We now verify that this rather technical condition is satisfied. 

Let f{z) be a function that is analytic at the origin, with radius of conver- 
gence p, positive on ]0,p[- Put f{z) into its exponential form f{z) — e'*^^-' and 
let 

a(r) — rh\r) and 6(r) = r^h (r) +rh'{r). 

The function f(z) is said to be H-admissible if there exists a function 6: ]0, p[ — > 
]0, 7r[ such that the following three conditions hold: 

(HI) linv^p6(r) = +oo. 

(H2) Uniformly for \0\ < S{r) 

f{re'% - f{r)e'^<^^''^~¥^b(r) ^^^^ ^ ^^^^^ ^ 

[That is, /(re*^) = /(r)e^^'^('')-5«'''('-)(l -|- 7(r, 6*)) with |7(r,6')| < ^(r) 
when \9\ < 6{r) and \iuir^pj{r) = 0.] 

(113) and uniformly for 5[r) < |0| < tt 

/(re*^)-\/6(r) = o{f{r)) when r tends to p. 



Lemma 2.8 The functions Hq(z), Hi{z) and H2{z) are H-admissible. 

Proof. First it is elementary to verify that Hp{z) — ^^P (li) ^ ~ 

0, 1, 2) is analytic at the origin, with radius of convergence p = 1, and is positive 
on the real segment ]0, p[. Following the definition of H-admissibility above, we 
find that Hp{z) = e''p(^) with hp{z) ^ - {p + 1) log(l - z), so that 
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Therefore Condition (HI) is satisfied. 

Let S{r) = (1 — 7-)^^/^^ (for a discussion on the choice of S, we refer the 
readers to [THl Chap. VIII]). For 6 small enough, one can expand hp{re^^) into 



ftp(re'^) = hp{r) + ^ a™( 



{iey 



to! 



with am{r) = r-^am-i{r) and ao{r) — hp{r) (by definition of a Taylor devel- 
opment). In particular, we find 

ai{r) = ap(r), 

a2(f) = bp{r), and 

a3(r) - ■^^-^((2+p)+4r-pr2). 

Thus, as r tends towards 1, for \9\ < 6{r), az{r)e'^ = O ((1 - r)!/") = o(l). 
More generally, amir)0"^ = O ((1 — 7')i/'"+i) = o(Q!m_i(r)). Therefore, uni- 
formly for \6\ < S{r) 

hp{re^) = hp{r) -f iea{r) - ]^9%{r) + o(l) 

and Condition (H2) follows. 
Finally, we have 



(1 + r2 - 2rcos6')(P+i)/2' Vl + - 2rcos6'^ 

We observe that for r > 0, r(cos6' — r) and (1 -I- — 2rcos6')^^ are decreasing 
functions of on ]0,7r[. Thus, for each r > and 5{r) < \9\ < tt, \Hp{re^^)\ is 
bounded above by |i?p(re"'^''')|, namely 



r(cos 6 — r) 



r(cos(l— r) 12 — r) 

exp — TT 



li? (re*(l"'^)^)| = Vl+r^-2rcos(l-r)i2 

^ (l + r2-2rcos(l-r)Ti)(p+i)/2 

In the neighborhood of 0, cos 6 = 1 — ^6"^ + 0{6'^) . So when r tends towards 
1, we have 

cos(l-r)iVi2 ^ l-i(l-r)i^/'^ + 0((l-r)i7/3^ , 

l + r^ -2rcos(l-r)"/i2 ^ 1 + - 2r + r(l - r)!^/*^ O ((1 - r)"/^) 

= (1 - r)2 (l + r(l - rf/' + O ((1 - r)"/=^)) , 



(1-r) 



-2 



1 + r2 - 2rcos(l - r)iVi2 1 + r(l - r)5/6 + ((i _ r)ii/3) 

= (1 - r)-2 (i _ r{l - r)^/« + O ((1 - r)^/^)) 
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Moreover 

r(cos(l — r)T5 — r) ^ f ^ 



(^1 - r - i(l - r)"/6 ^ O ((1 - 



r(l - 


- rf^^ ^ 






-(1 


_ ^)5/6 _ 




-rf/-^))). 



1 + r2 - 2r cos(l - r) (1 - r)2 

(l - r(l - rf/^ + C ((1 - 
= ^(l-i(l-r)"/6+o((i_,)i4/3)^ 

(l - r(l - r)5/6 + O ((1 - r)5/3)) 
= ^(l-(l-r)5/6+o((l-r)5/3)) . 

It follows that 

|ifp(re'(i-'^)"'")| = {l~r)-^P+ 

exp(- 

As a result, 

\Hp{re''^)\ < (l-r)"(f+i)(l + 0(l-r)^/6) 

exp " " ^^^^^ + ^ ((1 - ^)"^^^) ) 

\Hp{re^')\ < (^i?p(r)+exp(^^^0((l-r)t-(f+i))^ 

expf-— ^ + 0((l-r)i) 
V {l-r)e 

and hence 

that is, Condition (H3) is satisfied. □ 

We now want to use the following theorem [TUl Theorem VIII. 5]. 

Theorem 2.9 (coefficients of H-admissible functions) Let f{z) he a H- 
admissible function and ( = C("-) be the unique solution in the interval ]0,p[ 
of the saddlepoint equation 

^ /(c) 

Then 

[z"]/(z) = /(£)^ (1 + o(l)) . 

where b{z) = z^h"{z) + zh'{z) and h{z) = log/(z). 
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Let us record immediately an application of this result. 
Proposition 2.10 With the notation above, for p = 0, 1, 2, 

[z^]Hp{z) = ^ nP/'-'/^e^^{l + o(l)). 



Proof. For any positive integer n the saddle point Cp(^) is the least positive 

le 

[n + p + l)z^ - {2n +p + 2)z + n = 



H' ( 

solution of z f/ , , = n, that is, the least positive solution of 

Jrlp [z) 



and it follows that 



2{n+p+l) " U'/V' 
Moreover from Equation ([7]), the function bp{r) is bp{r) ~ ?'%YZ7yT^- Therefore 



[z"lF„(z) = ^ . / 2i (1 + o(l)) . 

Now = n-3(l+o(l)), (l-Cp)P = n-5(l+o(l)) and ^2^ ((p + 2) - Kp) = 

20F(l+o(l)). Finally ^ = (V^-l-f )(l+o(l)) and = exp(-7ilogCp) = 
exp(V^+£±i)(H-o(l)).''so 

„\Ai-l-p/2 

= £ ^e^+(P+i)/2nP/2-i/4(i + o(l)) 

2y'TT 



2v^ 



^p/2-l/4g2VTr(^_^_^(^))^ 



□ 



2.3.3 Expected value and standard deviation of Xn 

We now conclude the study of the expected value and the standard deviation 
of the number of sequences in a random partial injection. 

Lemma 2.11 The expected number E(X„) of sequences in a random partial 
injection of size n is asymptotically equal to ^/n with standard deviation o(y^). 

Proof. Recall that the expected values of the random variable X„ and X^^ and 
the variance of X„ are given in Equations ([U, ([5]) and © in Section !^. 3. II We 
can use Proposition 12 . 1 01 to estimate these quantities. 
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From Equation ^ wc find that 

(n - iU/4e2y^^^ 

HXn) - ^ J,,^,^ (1 + 0(1)) = V^H + 0(1)), 

that is, the expected number E(X„) of sequences in a random partial injection 
of size n is asymptotically equal to y/n. 
Similarly, from Equation ([5]) we get 

E{Xl) = + oil)) + VT^{^ + o(l)) = n(l + o(l)), 

and from Equation ([5]) ct^(X„) = o(n). Thus the standard deviation a(Xn) of 
the number of sequences in a random partial injection of size n is o(y^). □ 

2.3.4 Proof of Theorem [2761 

Before we prove Theorem 12.61 let us recall the statement of Chebyshev's in- 
equality. 

Proposition 2.12 (Chebyshev's inequality) If X is a random variable of 
expectation lE(^) with a finite variance a'^{X) then for any positive real a 

P{\X~E{X)\>a}<^^^. (8) 

As we already noted, given a pair of partial injections, a leaf in the re- 
sulting A-graph is an cndpoint of a sequence in one of the partial injections, 
and a singleton in the other. In view of Lemma 12.111 Chebyshev's inequality 
(Equation ([8])) applied to X„ and a — y/n, shows that 

P{|X„-V^|> V^} = o(l). (9) 

In other words, the probability that a partial injection of size n contains more 
than 2y/n sequences tends towards when n tends towards oo. Let us call such 
a partial injection sequence-rich. Then n-vertex labeled ^-graphs defined by 
a pair of partial injections, one of which at least is sequence-rich, occur with 
probability o(l). 

Let us now focus on the labeled A-graphs defined by partial injections, each 
of which contains less than sequences. Again, a vertex is a leaf if it is an 
endpoint of sequence for one of the injections and a singleton in the other one. 
As the two injections play symmetric roles, we estimate the probability for a 
vertex to be an endpoint in the first injection and a singleton for the second 
one: the total estimate will be bounded above by twice that probability. 

Since the first injection has at most sequences, it has at most A^/n 

endpoints. To estimate the number of second partial injections in which at least 
one of these vertices is a singleton, we count the number of partial injections 
that do not involve such a vertex. For any given vertex, there are In-i such 
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partial injections, and as we have to consider up to A^/n potential endpoints, the 
number of partial injections in which at least one of these vertices is a singleton 
is bounded above by A^/nln-i- 

Therefore there are at most S^/nln-iIn pairs of partial injections that are 
not sequence-rich and that exhibit at least a leaf. As /„_i < this number is 
less than or equal to -^In^ and the associated probability is less than or equal 

to -f=. 

Consequently the probability for an n-vertex labeled A-graph to have at 
least a leaf is less than or equal to 

0(1) + 4= = 

which concludes the proof. 



2.4 The number of size n subgroups 

Let Sn,r be the number of size n subgroups of F = F{A), where rank(F) = r. By 
Corollarv l2.71 the number of admissible labeled A-graphs of size n is /^(l-l-o(l)). 
By Lemma [T75l each size n subgroup is represented by (n— 1)! distinct admissible 
labeled A-graphs, so 

r 

Q n 

(n- 1)! 

Proposition [2TIO] gives us an equivalent of In/nl, and it follows that 

rr p — r/2 

CI ir — 1 n ir— 1 — r/4 1r\fn 



By Stirling's formula, n\ is equivalent to v27re "n"+2 and it follows that 



/27r 



3 Random generation algorithm 

As discussed in Section [TT2l and in particular in Section [1.2.3l fsee also Section [3T2l 
below), the core of our random generation algorithm for admissible A-graphs, 
is a procedure to randomly generate size n partial injections. 

The recursive decomposition of partial injections investigated so far allows us 
to use the recursive method introduced by Flajolet, Zimmermann and van Cut- 
sem (following work by Nijenhuis and Wilf [22]) to efficiently and randomly 
generate partial injections of size n. 

Recall that a partial injection is a set of disjoint components^ that are either 
cycles or non-empty sequences. The recursive method consists, in our case, in: 

• choosing the size A: of a component {k G {l,...,n}) according to the 
distribution of the sizes of components in a random size n partial injection; 
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• choosing whether that size k component is a cycle or a sequence accord- 
ing to the distribution of these two types among size k components; 

• and choosing a size n — k partial injection. 

We give more details below, on how these steps are performed. The result 
of the procedure is a sequence of symbols of the form ^i(fci) . . .^r(^r)) where 
ki + ■ ■ ■ + kr = n, the are in {a, k}, a{k) stands for sequence of size k, and 
K{k) stands for cycle of size k. Such a sequence represents, in a natural way, an 
unlabeled size n partial injection and the last step of the algorithm consists in 
randomly labeling that partial injection. 
Let us now be more precise. 



3.1 Partial injections 

The tool to grasp the distribution of the sizes of components in partial injections 
is the pointing operator 0: pointing a labeled object consists in marking one of 
its atoms, or equivalently one of its labels from {1, • • • , n}. Naturally, there are 
n ways of pointing an object of size n. So if the EGS of a labeled combinatorial 
class C is C{z) = "Y^CnZ^ /n\, then the pointed class of C, denoted by 6C, has 
EGS 

n>0 

If C is, as in our situation, defined as a set of components of a class T) with EGS 
D{z), then C{z) = exp(£)(2)) and 

@C{z) = zj-C{z) = z^^{cMD{z))) 



= z 



^D{z)\ exp(£)(0)) = QD{z) C{z). 
dz J 



The combinatorial interpretation of this equality is the following: marking an 

atom of an clement of C amounts to marking an atom of one of its components 
(of size, say, k), and the remaining part of the element of &C is a non-pointed 
element of C of size n — k. 

For partial injections, we have I{z) = exp(Z)(^)), with D{z) = + 
log(Y^). Therefore 



ei{z) = eD{z) X i{z) = [jY^ ^i^z) 

where ^^J^_^^2 is the EGS of pointed labeled sequences and is the EGS of 
pointed labeled cycles. Now 

^fe, z - f ^ if A; > 1, r fci z _ / 1 if fc > 1, 



(l-z)2 \0 iffc = 0, ^ 'l-z \0 ifA; = 0, 
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so we have 

-S = E(^ + i)7;rSv- (11) 



(n^fc)!- 



Therefore the probabiHty pk = P(size k) for the pointed component to be of size 
k is 

and the probabiHty for a size fc component to be a sequence (resp. a cycle) is 

fc 1 
P(sequence) — P(cycle) = 



k+l ^ ' k+\ 

We are now ready to describe the random generation algorithm for partial 
injections of size n. The discussion of its complexity is postponed to Section 

Let Uniform([0, 1[) be the function that returns a real number chosen uni- 
formly at random in the interval [0, 1[. Recall that if X is a random variable 
with values in |1, n] with probability f{X = i) = pi then the value of X can be 
generated randomly with respect to this probability distribution as follows (see 
[S] for example). 

RandomX 

dice = Uniform([0, 1[) 
k = l, S=pi 
while dice > 5 

k = k + l 

S = S + pk 
return k 

Our algorithm to randomly generate a partial injection of size n uses directly 
this idea. Because the probabilities discussed above (for a component of a 
random partial injection to have size k, for a size k component to be a cycle) 
are rational numbers, we choose to express the algorithm entirely in integers, 
in order to facilitate exact computation, and thereby to guarantee the absence 
of bias in the distribution of the partial injections. Concretely, we multiply dice 
and the pk by /„ . 

The algorithm requires a preliminary phase, during which a table containing 
the values of 7^ (0 < fc < n) is computed using the recurrence relation in 
Equation H]). 

We denote by Uniform (A'') a function that returns an integer chosen uni- 
formly at random in the interval [0, N[. 

RANDOmPARTIAlIn JECTION (n) 

Result — W 
while n > 

// Compute the size k of a component 
dice = Uniform (J„) 

k = 1,T = 1, S = 2I„-i // That IS, S = 7„pi 
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while dice > S 

k = k + l 

T = T*{n-k+l) 

S = S + {k+l)TI„-k // That IS, S = S + I„pk 
// Decide whether the component is a sequence or a cycle 
dice' = Uniform(A: + 1) 
if dice' < k 

then Append o-(fc) to Result 
else Append K{k) to Result 
n = n — k 
// Randomly label the final result 
Label Result with RANDOMPERMUTATlON(n) 
return Result 

The outer while loop of the algorithm produces a sequence of symbols of 
the form {^i{ki), . . . ,^r(fcr)), with each G {a, k} and such that ~ "^i 

that describes the size and the nature of the components of the size n partial 
injection. 

The last step of the algorithm, which randomly generates a permutation of 
the n elements on which the partial injection is defined, can be performed in 
linear time and space using the algorithm given in Section [51 

3.2 Admissible 74-graphs 

Recall (see Section 11.2.31) that our algorithm to generate admissible A-graphs 
consists in randomly generating, for each letter of the alphabet A, a partial 
injection of size n, and then using a rejection algorithm to keep only admissible 
A-graphs. 

Assuming that the table for the values of /„ is already computed, the algo- 
rithm reads as follows. 

Random Admissible AGRAPH(n) 
repeat 

for each a £ A 

compute the partial injection la for a using RANDOMPAR,TlALlNJECTlON(n) 
until the resulting A-graph is admissible. 



3.3 Complexity 

The algorithm requires manipulating large integers: /„ is of the order of 0(n"). 
Computing in multiprecision, that is, without any approximation, guarantees 
the absence of bias in the distribution of the generated objects. However, we 
also briefly discuss floating point implementations at the end of this section. 

We first evaluate the complexity of the algorithm in the RAM model, i.e., 
under the unit cost assumption (or uniform cost convention) according to which 
each data element (here: each integer) is stored in one unit of space, and the 
elementary operations (reading, writing, comparing, performing arithmetic op- 
erations, etc) require one unit of time, - even for large numbers. 
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The pre-computation phase that stores the values of uses the recurrence 
relation on the J„ given in Equation ([1]), and it requires 0{n) operations. 

The (worst case) time complexity T(n) of RANDOMPARTiALlNJECTiON(n) 
(assuming that the values of the /„ are precomputed) satisfies the following 
inequality 

T{n) < max {ck + Tin — k)) 

l<k<n 

(where c is a constant), so that T{n) < cn, that is, T{n) — 0{n). 

Next, checking whether the A-graph generated is connected can be done 
using common algorithms on graphs (depth- first search) in time 0(n). Checking 
1-trimness is also done in 0{n), by scanning the list of edges which has at most 
\A\ n elements. This part of the algorithm does not require manipulating large 
numbers. 

Corollarv l2 . 71 shows that size n A-graphs are admissible with probability 1 + 
o(l). Therefore the number of rejects (see Section [T2T3]) for lack of admissibility 
is equal in average to = 1 + o(l). Thus the average number of rejects 

tends to when n tends to infinity. 

In conclusion, for the RAM model, the random generation algorithm requires 
a precomputation that can be done in linear time, and it uses, in average, 0{n) 
operations to generate each admissible A-graph. 

In the case of the bit complexity (or logarithmic cost convention), an integer 
N is handled via its binary representation, of length ©(log TV). In particular, 
the representation of /„ has length 0{n\ogn). The basic operations on numbers 
of that size (reading, writing, comparison, addition, multiplication by a number 
whose binary representation is of length 0{n)) are performed in time 0{n log n), 
whereas the multiplication of two such numbers takes time O{nlog^ n). Under 
this bit-cost assumption, the time and space required for the pre-computation 
are 0{n'^ log n) (instead of 0{n) in the RAM model). And each random draw 
takes time ©(n^ log^ n). 

Remark 3.1 Under the bit-cost assumption, we should also take into consider- 
ation the complexity of the function Uniform(A^). Since this function returns 
an integer, it can be performed by a rejection algorithm, randomly choosing 
each bit of (the binary expansion of) N, that is, in 1 -I- [lognj unit cost oper- 
ations. In such a process, the probability that the integer generated is greater 
than N is at most 1/2 (^^ in base b), so the average number of reject is at 
most 2, and the complexity of Uniform(A^) is ©(logiV). □ 

In practice, it is often convenient to use floating point arithmetic instead of 
multiprecision arithmetic. In theory, the approximations made in floating point 
arithmetic induce a loss of precision, and may therefore introduce a bias in the 
probability distribution of the generated objects. 

Denise and Zimmermann [5] showed that the complexity of the floating point 
implementation is the same as for the RAM model. They also show that, if 
certain precautions are taken (essentially in the choice of the rounding operator 
for each operation), a floating point implementation introduces only a negligible 
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bias in the probability distribution of the generated objects. In the case of partial 
injections, using the standard rounding operator does not seem experimentally 
to produce a significant bias, but it is not theoretically proved. 



4 On the rank of a size n subgroup 

We conclude this paper with a few applications of the above results to the study 
of the rank distribution of size n subgroups. The first one concerns the expected 
value of this rank, and the others establish the intuitive results that finite index 
(resp. fixed rank k) subgroups are asymptotically negligible. 

Recall that the rank of a subgroup H with a size n graphical representation 
r is equal to |i?(r)| — n + 1, where |i?(r)| is the number of edges of the graph 
r (see Section [T7T|) . 

Corollary 4.1 The average rank of a size n subgroup of F{A) is {\A\ — l)n — 
\A\-\/n + I, with standard deviation o(^/n). 

Proof. Let F an A-graph. For each letter a of the alphabet A, the number of 
o-labeled edges is the difference between n and the number of sequences in the 
functional graph of the partial injection determined by the a-labeled edges. In 
view of Lemma 12.111 the number of a-labeled edges is therefore asymptotically 
equal to n — ^/n, with standard deviation o(^/n), and the announced result 
follows. □ 



Corollary 4.2 Let H be a size n subgroup of F{A) and let k > 1 be an integer. 
Then the probability that rank{H) < k is asymptotically o(^). 



Proof. CoroUarv 14.11 shows that the mean value of the rank of H is E(rank) = 
{\A\ — l)n — \A\^/n + 1, with variance (T^(rank) = o{n). 

If rank(iJ) < k, then in particular |rank(7f) — E(rank)| > E(rank) — k. It 
follows, by Chebyshev's inequality (see Equation ([5]) above), that 

P{rank<fc} < P{|rank - E(rank)| > E(rank) - fc} 
o(n) 1 
C(n^) n 

□ 



5 Finite index subgroups 

We saw in Section [1.11 that a finitely generated subgroup has finite index if and 
only if, in its graphical representation, every letter labels a permutation, that 
is, a partial injection whose domain is the full set of vertices. Let us say, in 
that case, that the corresponding A-graph is a permutation A-graph. Based on 
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this observation, we can adapt our approach to get a hnear time (in average) 
random generation algorithm. 

As in the general case (see Section 13. 2p , we use a rejection algorithm: we 
repeatedly randomly generate a permutation of size n for each letter a g A, 
until the resulting graph is connected and 1-trim. 

Random generation of permutations is a classical object of study, and it 
can be performed in time 0{n) (in the RAM model, 0{n\ogn) in the bit-cost 
model) using the following algorithm (see [S] for example): 

RANDOMPERMUTATION(n) 

for i e {1, • • • , n} 

V[i\ = i 
for i from 2 to n 

j = \ + UNlFORM(i) // j is a random integer in 

Swap P\i] and P[j] 
return P 

Note that this algorithm does not require manipulating large integers. 

The efficiency of the rejection algorithm depends on the average number of 
rejects, and hence on the probability, for a permutation ^-graph to be connected 
and 1-trim. Trimness is a moot point since a permutation A-graph never has 
any leaf. 

Connectedness is not guaranteed, but we note that Dixon [6] uses Bender's 
theorem fTheorem l2.5l above) to compute the asymptotic expansion of the prob- 
ability for a pair (or a r-tuple) of size n permutations to generate a transitive 
subgroup of §„, that is, to define a connected permutation A-graph. He shows 
in particular that this probability is of the form 1 — l/n^~^ + 0(l/n^^''~^''). 

Thus the average number of rejects tends to when n tends to infinity, 
and the average case complexity of the random generation of an admissible 
permutation A-graph is 0{n) (in the RAM model, 0{n\ogn) in the bitcost 
model). 

We can also show that finite index subgroups are asymptotically negligible 
among subgroups of a given size. 

Proposition 5.1 The probability for a randomly chosen size n subgroup of 
F{A) to have finite index is C'(n''/*e"^'"^). In particular, it is a {n'^^^ for 
any k >1. 

Proof. Let r — rank(F) = \A\. The number of size n finite index subgroups 
is at most the number of r-tuples of permutations, namely nl'' . The number of 
size n subgroups is, according to the discussion in this paper, equivalent to the 
number of r-tuples of partial injections, that is, it is equal to /^(l -I- o(l)). 

Thus the probability that a size n subgroup has finite index is at most equal 
to ^-T") (1 + o(l))- By Proposition 12.101 (applied with p — 0), we know that 
I,Jnl = ^^(n^i/^e^V^). Therefore 

= O (n'l^e-^^^^ , 
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which converges to faster than the inverse of any polynomial. 



□ 



6 A few questions 

A first question, prompted by Proposition 15.11 is the following. Even though 
finite index subgroups are negligible among finitely generated subgroups, we 
saw in Section [5] how to randomly generate them. When k is fixed, rank k 
subgroups are also asymptotically negligible among finitely generated subgroups 
(Corollarv l4.2p . Can we find an efficient random generation algorithm for these 
subgroups? 

Our second question is related with another method used in the literature to 
generate subgroups (not only for free groups). This method is based on the idea 
of randomly generating a fc-tuple of elements that generate the subgroup — with 
k fixed and, say, the maximal length of the generators being allowed to tend 
to infinity, ft is used for instance by Jutsikawa |14, to study the distribution 
of malnormal subgroups in free groups. We refer also to Martino, Turner and 
Ventura |19| on the distribution of monomorphisms between free groups, and 
to Miasnikov and Ushakov [20] for a survey of this technique in relation with 
group-based cryptography. 

The question that arises in this context is to compare the distribution of 
subgroups that occurs with this generation scheme and the distribution we 
considered in this paper. They must be different since we fix the size of the 
subgroups generated, whereas they fix the number and the maximal length of 
a set of generators, which may lead to graphical representations of varying size. 
One must also take into consideration the fact that each subgroup is generated 
by a potentially large number of fc-tuples of generators. However, it remains 
possible that generic properties (those that have asymptotically probability 1) 
coincide for both distributions. 
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